Internal protein dynamics shifts the distance to the mechanical transition state 
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Mechanical unfolding of polyproteins by force spectroscopy provides valuable insight into their 
free energy landscapes. Most phenomenological models of the unfolding process are two-state and/or 
one dimensional, with the details of the protein and its dynamics often subsumed into a zero-force 
unfolding rate and a single distance x^P to the transition state. We consider the entire phase space 
of a model protein under a constant force, and show that the distance a;* D contains a sizeable 
contribution from exploring the full multidimensional energy landscape. Proteins with more degrees 
of freedom are expected to have larger values for :r* D . We show that externally attached flexible 
linkers also contribute to the measured unfolding characteristics. 
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Atomic force microscopy or optical tweezers are now 
routinely used to study the mechanical properties of pro- 
teins An important issues is the unfolding behavior 
of folded domains, including the strength, and the depen- 
dence on fold topology 0] and secondary structure [jj, 0] • 
The simplest description of unfolding treats the unfolding 
domain as moving in a one dimensional potential G{x), 
where the reaction coordinate x is assumed to be directly 
coupled to the applied force. In perhaps the simplest ap- 
proximation, the rate of unfolding k u (F) of a two-state 
(native and denatured) protein under force can then be 
calculated using Bell's formula based on Kramers' rela- 
tion for escape from a well P, Q : 



k u (F) ~ fc°exp 



ID 



V k B T 



(1) 



where ks is Boltzmann's constant, T is the absolute tem- 
perature, x^P is the transition state displacement along 
the force projection, and is the unfolding rate con- 
stant at zero force. This result can be used to calculate 
the distributions of unfolding times (for applied force) 
or unfolding forces (for applied pulling speed), and has 
been applied to many different proteins j(| 0, @, 0, E3] • 
This simple result has been corrected to use the entire 
shape of G(x) rather than just the barrier height and 
displacement x* D and incorporate the cantilever compli- 
ance [HI, EH , to relax the diffusive limit to faster speeds 
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[131 ] , and to allow for multiple pathways or states 
There has been extensive work on the utility of a the x u 
and the simple ID picture, linear force dependence, etc 
16, 17] [More work to do here!!!] 

However, the assumption of a one dimensional reac- 
tion coordinate grossly simplifies physical reality. The 
unfolding rate depends dramatically on pulling direction 
Tel 1^ ] , and hence on the multidimensional nature of the 
free energy landscape. Moreover, the one dimensional 
parameters have no satisfactory physical interpretation: 
is defined along the pulling direction, while the ac- 



tual unfolding takes place along an unknown reaction co- 
ordinate^), presumably involving a few key amino acids. 
While x}P ~ 0.2 nm is the right order of magnitude for 
hydrogen bonds, the explicit connection with molecular 
configurations remains unclear. In this Letter we explore 
the role of the multidimensional energy landscape in de- 
termining the effective ID unfolding parameters. The key 
physical ingredient is that an applied force perturbs fluc- 
tuations transverse to the forcing direction, because of 
the highly anharmonic nature of angular bonds. Specif- 
ically, force restricts a protein's conformational search 
among the dihedral states of the polypeptide backbone. 
We calculate this using molecular dynamics (MD) sim- 
ulations of a simple off-lattice Go model [13, HH for a 




FIG. 1: The free energy surface of a two-state system, with 
native (N) and unfolded (U) minima. An external force F 
parallel to Xy lowers the barrier to unfolding by Fx u - 
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topologically simple protein, and show that this restric- 
tion leads to a sizeable contribution to a;^ D . 

Escape from a multidimensional energy landscape — 
The rate of escape from an A/"-dimensional energy land- 
scape under an applied force F (Fig. [T]) is given by [22l | 



k u (F) =r(F)exp 

r(F) = 



( Fx u 

\k B T 

G'tT\ nf=i ^G' N HF) 



(2) 
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where x u is the distance to the transition state, 7 is a 
friction coefficient, and AGts-n is the height of the 
free energy barrier relative to the native basin. G T RC 
is the curvature in the unstable direction at the transi- 
tion state, G T ? are the N — 1 stable curvatures at the 
r N '(F) are the TV positive curva- 



T TS 

transition state, and G» fe ' 



tures about the native basin under an applied force. If 
the transition state is sharp i.e., \G C RC \ ^> F/x u , the 



curvatures G TS at the transition state, as well as x u 



are 



approximately independent of force. A more physical 
representation of the attempt frequency T(F) follows by 
relating a curvature G k to the associated fluctuations 
by (5x1 ) = k B T/G" k . This yields 



r(F) = 



k B T V TS 
2tt 7 ^ V n (F) 



exp 



-AGts- 



N 



(4) 



where 1^9 is the width of the transition state along the 
unfolding reaction coordinate, Vts is the volume of phase 
space available for fluctuations at the transition state and 
Vn{F) is the corresponding volume in the native basin. 

In one dimension the weak force dependence of the 
prefactor T(F) can be safely ignored [23[. A weak per- 
turbation of the native basin volume Vjv(F) leads to 



K(F) ~ k° u 



exp 



F \x u + Xk B T] 



k B T 



(5) 



where A 



d[lnV(F)]/dF\ 



F=0- 



Thus, any change in 



the volume of the native basin with force will shift x}P 
in the equivalent one dimensional model, 



„1D 



Afc„T = 



Sx,, 



(6) 



where Sx u is a dynamical, or entropic, contribution to the 
transition state displacement. If the volumes associated 
with different degrees of freedom randomly increased or 
decreased with an applied force, there would be little 
effect. However, we expect the volume of most perturbed 
degrees of freedom to decrease under an applied force, 
so that Sx u is proportional to the number of perturbed 
degrees of freedom. For a force-independent volume we 
recover Eq. [TJ with Sx u = 0. 



Calculation of phase space volumes — Phase space fluc- 
tuation volumes were calculated from MD simulation tra- 
jectories. MD was performed for protein L (PDB refer- 
ence: 1HZ6 [H) using the C Q Go model of Refs. HIH- 
The simulation protocol is described in detail in [3J. A 
real protein does not fluctuate about a single well-defined 
average structure; because dihedral angles typically ac- 
cess discrete values, the accessible states are fluctua- 
tions about many well defined structures, or nodes in 
phase space. Fig. [2] shows the phase space explored by a 
tetramer with 2 unimodal and 2 bimodal dihedral distri- 
butions. 

The total unfolding rate k i ^ >t (F) is the weighted sum 
of the escape rates fcg (F) from all nodes [3 = 1...M 
(assuming AGts-n and x u are the same for all nodes); 



kT(F) 



k B T 



Vts 



2njl^V^(F) 



■ exp 



(AG TS -n — Fx u 



k B T 
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E 



PP(F) 



(7) 
(8) 



where V$(F) is the volume and P@(F) the occupation 
probability of node [3. The quantity V^ S (F) is the effec- 
tive phase space volume of the native basin. 



(a)' p 


1 1 I 1 1 
3,1 P 


' I ' I 

3,2 

\ k=3 

a \ 

— -i ^-f- L 


P 

J *C J 


4,1 P 

\ / 


1,2 

\ k=4 

1 , ^-t- 



• t 



-2 -1 

£i (radians) 



FIG. 2: The phase space of an oligomer with 4 dihedral an- 
gles: two of these are unimodal (k — 1,2, not shown) and 
two are bimodal [k = 3,4). (a) The probability distribu- 
tion function pk,tata\ for each bi-modal dihedral angle (black 
solid line) can be resolved into separate distributions Pk, n (red 
dashed lines) about well defined averages (green vertical dot- 
ted lines) . There are four possible structures corresponding to 
fluctuations around {^3,1,04,1}, {03,1,^4,2}, {^3, 2, H^i} and 
{$13,2,^4,2} (b) The phase space projected onto {0.3,0.4}. 



The occupation probability P f3 (F) of each node is 

/JV-3 



P P {F) 



Pk,n^k{t)) 
^ Pk,total(^k(*)) 



n 



(9) 



where Y,p=i P P {F) = 1 and (. . .) is the average over the 
MD trajectory. A node is specified by a particular set of 
occupancies of each dihedral angle $1 = (f2i, . . . Ojv-3), 
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where N is the number of atoms. Each term in the 
product is the normalised probability that, in node (3, 
a given dihedral angle J\. participates in its nth dihedral 
state (peak) (Fig. For large numbers of nodes M a 
mean field approach, in which all nodes are assumed to be 
equally populated, works well when states are sufficiently 
uncorrelated in time, as in this case [26[. 

The volume V^-(F) of fluctuations about each node /? 
is given by Vg(F) = VdetC 5 , where d/ = (SnSr^p 
is the covariance matrix for fluctuations Sri in each C a 
position Yi. Here the angle brackets denote an average 
within node /?. We calculate C@ by transforming coordi- 
nates to bond lengths, bond angles, and dihedrals angles. 
We ignore correlations between bond and dihedral angles, 
which is an excellent approximation here [261 ] - Hence, the 
effective volume of phase space is given by 



1 



1 



Vf(F) 



Ve(F) £j V g(F) 



(10) 



where Vg(F) and Vq(F) are the volumes of phase space 
explored by the bond and dihedral angles respectively. 
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3: (a) Effective phase space volume Vjy (F) for protein L 

\nVf(0)- 



FIG 

(o). Dashed line is a fit to In Vf (F) ~ In (0)-Sx u F/k B T, 
which yields the dynamic contribution to the transition state 
placement 8x u = 0.054 ± 0.008 nm (Eq. [6j . (b) Average un- 
folding times for protein L using MD at T = 300 K, with ni 
attached glycine linkers (o: m = 0, = 0.191±0.004 nm, □: 
n t = 16, xl D = 0.241 ± 0.004 nm, o: m = 32, a;™ = 0.267 ± 
0.004 nm). The linear fits yield logr = A - x™F/(k B T). 
Error bars are of order the symbol size. 



number of multi-modal dihedral peaks. The latter ef- 
fect dominates, since the loss of a single dihedral peak 
immediately removes many nodes of phase space. Sim- 
ulations of the unfolding of the same protein L domain 
(Fig. [3Jd) yield an effective ID transition displacement 
x}P = 0.191 ± 0.004 nm, from measuring an exponential 
dependence of the unfolding time t u on applied force, 
t u ~ e~ Fx ^ /( feflT ), as predicted from a single reaction 
coordinate description, Eq. (JTJ. Hence we conclude that 
the bare transition state position was x u = 0.137nm, and 
the large shift of 5x u = 0.054 ± 0.008 nm is between 34% 
and 45%. 

Linker Effects — For convenience, protein domains 
are often pulled with long linkers, or unfolded protein 
strands. The linkers also fluctuate about discrete di- 
hedral states when stretched. The "lumpincss" of this 
phase space is irrelevant for weakly stretched strands, but 
dominates the response for strongly stretched strands. 
Since force is coupled to the folded domain through the 
linkers, the total available phase space is the product of 
protein and linker phase spaces, and the measured x^P 
depends on the restriction of the linkers' phase space. 
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FIG. 4: (a) Volume of phase space calculated using the mean 
field method and (b) the dynamic contribution to the distance 
to the transition state 8x u = —kBTd^nVff 1 ]/ dF , for strands 
of flexible glycine linkers. A constant force was applied for a 
total time of 1 fis at T=300K. 



Results — Fig. [3^, shows the phase space volume as a 
function of force calculated from MD simulations of pro- 
tein L. The dynamic contribution to the transition state 
placement is Sx u = 0.054 ± 0.008 nm. The reduction 
of phase space volume comes from (1) the narrowing of 
the dihedral distributions, and (2) the reduction in the 



To test this, homogeneous linkers strands were con- 
structed from a dihedral potential based on glycine. 
Fig. |4] shows the normalised effective phase space volume 
VJy ■(F)/V^(Q) and the corresponding Sx u as a function 
of force for different number of atoms ni per linker. The 
effect is greater for longer linkers, since more nodes are 
available to remove. We have ignored any force depen- 
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dence of the volume of the transition state Vts (Eq. [4]). 
Although we cannot easily characterize the (unstable) 
transition state, we can compare the shifts 5x u measured 
directly from the unfolding times (Fig. [3]) with predic- 
tions from the phase space volumes (Fig. 01 . The dif- 
ference 5x u {ni = 32)— 6x u (ni = 16) ~ 0.03 — 0.05 nm 
from the calculation of phase space volumes (at forces of 
order 200 — 300 pN) agrees with the difference x u (n; — 
32)— x u (ni = 16) = 0.026 nm measured from pulling sim- 
ulations. This gives us confidence that for this model 
of protein L the transition state is sharp and its volume 
does not change appreciably under an applied force. 

Discussion — We have shown that an externally ap- 
plied force restricts a fluctuating protein's accessible 
phase space volume, which increases the transition state 
displacement in the equivalent one dimensional two- 
state model. This contribution can be appreciable be- 
cause of the many degrees of freedom in a protein. Larger 
proteins have a potentially larger x}P, depending on 
which degrees of freedom couple to the applied force; 
this will depend critically on the topology of the fold and 
the direction in which it is pulled. Most importantly, 
we predict that should be greater for proteins un- 
folded through longer attached linker strands. This may 
have biological significance; e.g. the long unfolded PEVK 
regions in titin [27| may play help modify the unfold- 
ing characteristics of titin. Finally, we note that many 
experiments have unfolded concatamers of multiple do- 
mains, for convenience of attachment and to generate 
larger statistics @, M, M, M, E3, S, M, H El . We sur- 
mise that in all of these cases the dynamic contribution 
to x^P was significant, and also included a contribution 
from already unfolded domains, which act as "linkers" 
for the last few domains to unfold in a given pull. 
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